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The invention relates to polypeptides comprising cytokine ligands wherein at least 
one binding site for said ligand's cognate receptor is disrupted; oligomers thereof and 
5 their use as pharmaceutical agents. 

A large group of growth factors, referred to as cytokines, are involved in a number of 
diverse cellular functions. These include, by example and not by way of limitation, 
modulation of the immune system, regulation of energy metabolism and control of 

10 growth and development. Cytokines mediate their effects via receptors expressed at 
the cell surface on target cells. Cytokine receptors can be divided into four separate 
sub groups. Type 1 (growth hormone (GH) family) receptors are characterised by 
four conserved cysteine residues in the amino terminal part of their extracellular 
domain and the presence of a conserved Trp-Ser-Xaa-Trp-Ser motif in the C- 

15 terminal part. The repeated Cys motif are also present in Type 2 (interferon family) 
and Type DI (tumour necrosis factor family). 

It is known that many cytokine ligands interact with their cognate receptor via 
specific sites. Some cytokine receptors have both high affinity ligand binding sites 
20 and low affinity binding sites. 

For example, it is known that a single molecule of GH associates with two receptor 
molecules (GHR) (Cunningham et al, 1991; de Vos et aL, 1992; Sundstrom et al, 
1996; Clackson et al. 9 1998). This occurs through two unique receptor-binding sites 

25 on GH and a common binding pocket on the extracellular domain of two receptors. 
Site 1 on the GH molecule has, a higher affinity than site 2, and receptor dimerization 
is thought to occur sequentially with one receptor binding to site 1 on GH followed 
by recruitment of a second receptor to site 2. The extracellular domain of the GHR 
exists as two linked domains each of approximately 100 amino acids. It is a 

30 conformational change in these two domains that occurs on hormone binding with 
the formation of the trimeric complex GHR-GH-GHR. Internalisation of the GHR- 
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GH-GHR complex is followed by a recycling step whereby the receptor molecule is 
regenerated for further use within the cell. 

A variety of different stoichiometrics are employed by different cytokines and other 
5 ligands on receptor binding. Thus erythropoetin, like GH, forms a trimeric receptor- 
hormone-receptor complex. Interleukin-4 forms a trimeric receptor-hormone- 
different receptor complex. Other cytokines, for example leptin and GCSF, form 
tetrameric receptor-hormone-hormone-receptor complexes, and others (eg interleukin 
6) probably form hexameric complexes consisiting of two soluble receptor 
10 molecules, two transmembrane receptor molecules and two cytokine molecules. In 
each case there is a primary high affinity binding site that locates the cytokine to the 
receptor complex, and additional sites which play secondary roles in altering the 
conformation or recruiting other molecules and thereby initiating signalling. 

15 Variant cytokine polypeptides are known. For example, GH variants are disclosed in 
US 5, 849, 535. The modification to GH is at both site 1 and site 2 binding sites. 
The modifications to site 1 produce a GH molecule that has a higher affinity for GHR 
compared to wild-type GH. These modified GH molecules act as agonists. There is 
also disclosure of site 2 modifications that result in the creation of GH antagonists. 

20 Further examples of modifications to GH which alter the binding affinity of GH for 
site 1 are disclosed in US 5,854,026; US 6,004,931; US6,022,71 1; US6,057,292; and 
US6136563. These modifications relate to point mutations at specific positions in 
GH which produce a molecule with altered signalling properties. 

25 Circular permutation is a means to generate polypeptide variants that retain the 
overall tertiary structure of a native polypeptide but re-orders the primary linear 
sequence by forming new amino and carboxyl termini. The process generates 
molecules with altered biological properties. The process includes the fusion of the 
natural amino and carboxyl termini either directly or by using linker molecules that 

30 are typically peptide linkers. The conceptually circularised molecule is then cut to 
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generate new amino and carboxyl termini. Circularly permuted polypeptides can be 
generated either recombinantly or by in vitro peptide synthesis. 

Circular permutation has been used to generate chimeric molecules with altered 
5 biological activity. 

For instance, W095/27732 discloses the creation of a circularly permuted IL-4 ligand 
fused to a cytotoxic agent. The permuted IL-4-agent has altered affinity and 
cytotoxicity when compared to a native IL-4-agent and has efficacy with respect to 
10 killing cancer cells which are exposed to the conjugated polypeptide. 

W099/51632 describes the use of circular permutation to generate novel streptavidin 
binding proteins that have reduced affinity for biotin. The circularly permuted 
streptavidin is fused to a second polypeptide to create a fusion protein that 
15 differentially binds biotin. The reduced affinity of the strepavidin fusion protein for 
biotin facilitates release of the fusion protein when biotin is used as a drug delivery 
vehicle. 

WO01/51629 discloses circularly permuted bacterial P-lactamase and its use as a 
20 marker protein for the detection of interactions between intracellular and extracellular 
proteins that assemble with the permuted polypeptide. 

Methods to identify circularly permuted polypeptides are also known. For example; 
WO00/ 18905, which is incorporated by reference, describes a method to identify 
25 permuted polypeptides, referred to as "permuteins", using a phage display vector into 
which a library of permuted genes are inserted. The expression of the library at the 
surface of the display vector is detected by exposure of the expressed library to a 
binding protein that potentially interacts with a permutein. 

30 WOO 1/30998, which is incorporated by reference, discloses a further method to 
generate and identify circularly permuted proteins. The invention relates to the 
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formation of fusion proteins comprising the amino terminal part of a first protein 
fused to the carboxyl terminal part of a different second protein from which 
peimuteins are synthesised. A library of fusion proteins is created which can be 
screened by phage display. 

5 

We have applied the process of circular permutation to cytokine ligands to generate 
polypeptide variants that are "gapped" at binding sites for binding partners and/ or 
their receptors to produce ligands with altered properties (antagonists or agonists). 
Ligands that interact with receptors to bring about a suitable biochemical response 
10 are known as agonists and those that prevent, or hinder, a biochemical response are 
known as antagonists. For example, and not by way of limitation, cell specific growth 
factors are ligands that act as agonists and bind receptors located in cell membranes 
to activate cell division, growth or differentiation. 

15 As an illustration of the technique, we have generated a series of circularly permuted 
GH constructs in which the formation of new amino and carboxyl terminal termini is 
localised to site 2 of GH thereby disrupting the low affinity binding site for ligand 
binding to the GHR. This therefore allows docking of GH via its high affinity site 1 
domain but produces a complex which is incapable of activating GHR. 

20 

We also disclose oligomers (tandems, trimers, etc) of said permuted polypeptides that 
have additional properties, for example delayed clearance from the circulatory 
system. The in vivo efficacy of many cytokines, for example GH, is determined in 
part by the affinity for GHR and rate of clearance from the circulation. Permuted 
25 polypeptides may also be fused to the extracellular binding domain of their cognate 
receptors). For example, we describe in PCT/GB0 1/02645; WO01/096565, the 
fusion of ligand binding domains to' their cognate receptors (for example the fusion 
of GH to GHR via linker molecules) that retain biological activity and have the 
advantageous property of delayed clearance. 
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The kidneys are relatively small organs that receive approximately 25% of cardiac 
output. The kidneys perform several important functions primarily related to the 
regulation of the composition and volume of body fluids. The kidneys filter about 
100 litres of plasma every day and of the blood flow in and out of a kidney only 
5 approximately 1% becomes urine. Approximately 20% of the plasma that passes 
through the kidney gets filtered into the nephron. Filtration takes place, in the 
glomerulus that is driven by the hydrostatic pressure of the blood. Water and small 
molecules are filtered whereas blood cells and large molecules, for example 
polypeptides, do not pass through the glomerular filter. Those polypeptides with an 

10 effective molecular weight above 70 kDa are not cleared by glomerular filtration 
because they are simply too large to be filtered. Certain proteins of small molecular 
weight are filtered by the glomerulus and are found in the urine. For example, GH 
has a molecular weight of 22. 1 kDa and the kidney is responsible for clearing up to 
60-70% of GH in humans (Baumann, 1991; Haffiier et al, 1994), and up to 67% in rat 

15 (Johnson & Maack, 1977). Other examples of relatively small molecular weight 
polypeptides that are filtered by the kidney include leptin, erythropoeitin, and EL-6. 

According to an aspect of the invention there is provided a modified ligand 
polypeptide comprising a modified amino acid sequence which is a modification of 
20 the native amino acid sequence of said ligand, wherein the native amino terminal and 
carboxyl terminal amino acid residues of the native polypeptide are linked, directly or 
indirectly, together, characterised in that said ligand is provided with alternative 
amino terminal and carboxyl terminal amino acid residues and further wherein at 
least one binding domain for said ligand's cognate binding partner is disrupted. 

25 

According to an aspect of the invention there is provided a modified cytokine ligand 
polypeptide comprising a modified amino acid sequence which is a modification of 
the native cytokine amino acid sequence of said ligand, wherein the native amino 
terminal and carboxyl terminal amino acid residues of the native polypeptide are 
30 linked, directly or indirectly, together, characterised in that said ligand is provided 
with alternative amino terminal and carboxyl terminal amino acid residues and 
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further wherein at least one binding domain for said ligand's binding partner is 
disrupted. 

In a preferred embodiment of the invention said native cytokine ligand is selected 
5 from the group consisting of: growth hormone; leptin; erythropoietin; prolactin; 
tumour necrosis factor (TNF), interleukins (IL), IL-2, IL-3, IL-4, IL-5, IL-6, EL-7, IL- 
9, DL-10, EL-11; the p35 subunit of IH2, IL-13, D-,-15; granulocyte colony 
stimulating factor (G-CSF); granulocyte macrophage colony stimulating factor (GM- 
CSF); ciliary neurotrophic factor (CNTF); cardiotrophin-1 (CT-1); leukemia 
10 inhibitory factor (LDF); oncostatin M (OSM); interferon, IFNa and IFNy, 
osteoprotogerin (OPG). 

In a further preferred embodiment of the invention said binding partner is a receptor 
for said ligand. 

15 

In an alternative preferred embodiment of the invention said binding partner is a 
second ligand which forms a complex with said ligand and/or said receptor. 

In a preferred embodiment of the invention said cytokine ligand is growth hormone. 

20 

In a further preferred embodiment of the invention said native amino terminal and 
carboxyl terminal amino acid residues are directly linked to each other. 

In an alternative preferred embodiment of the invention said native amino terminal 
25 and carboxyl terminal amino acid residues are indirectly linked by a linking 
molecule. Preferably said linking molecule is a peptide linkage. 

In a preferred embodiment of the invention said linking peptide is a flexible peptide 
linker. 

30 
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Preferably the flexible linker is a polypeptide that comprises 5 to 30 amino acid 
residues. More preferably the linker comprises 10 to 20 amino acid residues. More 
preferably still the linker comprises at least one copy of the peptide: 

5 Gly Gly Gly Gly Ser (hereinafter referred to as "GljrtSer")- 

In one embodiment of the invention the linker is 10 amino acids in length and 
comprises two copies of the Gly4Ser linker. In an alternative embodiment of the 
invention, the linker is 15 amino acids in length and comprises three copies of the 
10 Gly4Ser linker. In yet an alternative embodiment, the linker is 20 amino acids in 
length and comprises four copies of the Gly4Ser linker. 

In an alternative embodiment of the invention said linker is an inflexible linker, for 
example a linker wherein said linker is, over part of its length, has a a-helical region. 

15 

For example, Arai et al {Protein Eng 14(8): 529-532 (2001), which is incorporated 
by reference, investigated the use of helix-forming peptides A(EAAAK) n A to 
separate domains of different green fluorescent proteins and measured the separation 
of the proteins using FRET, showing such linkers to behave as rigid entities of fixed 
20 length. JP2002247997, which is incorporated by reference, utilises an a-helical 
linker sequence to link IL-6 ligand to IL-6 receptor. It is envisaged that linkers that 
have properties between the extremes of a flexible linker and a helical linker may 
also be utilised to link respective native ends of cytokine ligands. 

25 In a further preferred embodiment of the invention said receptor binding domain of 
said ligand comprises a low affinity bind site. 

In a preferred embodiment of the invention said low affinity binding domain is site 2 
of growth hormone. 

30 
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In a further preferred embodiment of the invention said low affinity binding domain 
of growth hormone is between about amino acid 116 - amino acid 122 of human 
growth hormone as represented by the amino acid sequence shown in Figure 1. 

5 In a further preferred embodiment of the invention said alternative amino terminal 
and carboxyl terminal amino acid residues are derived from between amino acid 116 
and amino acid 122 of human growth hormone as represented by Figure 1. 

In a further preferred embodiment of the invention said alternative amino terminal 
10 and carboxyl terminal amino acid residues are derived from between amino acid 1 18 
and amino acid 121 of human growth hormone as represented by Figure 1. 

In a further preferred embodiment of the invention said alternative amino terminal 
15 and carboxyl terminal amino acid residues are derived from between amino acid 119 
and amino acid 121 of human growth hormone as represented by Figure 1. 

In a further preferred embodiment of the invention said alternative amino terminal 
and carboxyl terminal amino acid residues are derived from between amino acid 120 
20 and amino acid 121 of human growth hormone as represented by Figure 1 . 

In a further preferred embodiment of the invention said alternative amino terminal 
and carboxyl terminal amino acid residues are derived from between amino acid 118 
and amino acid 120 of human growth hormone as represented by Figure 1. 

25 

In a further preferred embodiment of the invention said alternative amino terminal 
and carboxyl terminal amino acid residues are derived from between amino acid 119 
and amino acid 120 of human growth hormone as represented by Figure 1. 

30 

In an alternative preferred embodiment of the invention said alternative amino 
terminal and carboxyl terminal amino acid residues are derived from between about 
amino acid 100 and amino acid 102 of human growth hormone as represented by the 
amino acid sequence shown in Figure 1 
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In a further alternative preferred embodiment of the invention alternative amino 
terminal and carboxyl terminal amino acid residues are derived from between about 
amino acid 130 - amino acid 132 of human growth hormone as represented by the 
5 amino acid sequence shown in Figure 1 

In a further preferred embodiment of the invention there is provided an oligomeric 
cytokine ligand polypeptide comprising at least two modified cytokine ligand 
polypeptides according to the invention wherein said ligands are linked, either 
10 directly or indirectly, together. 

In a preferred embodiment of the invention said ligands are linked by a flexible 
peptide linker molecule. In an alternative preferred embodiment of the invention said 
ligands are linked by an inflexible peptide linker molecule, preferably said linker 
1 5 molecule comprises a ct-helical region. 

In a preferred embodiment of the invention said oligomer comprises two modified 
cytokine ligand polypeptides. 

20 In a further preferred embodiment of the invention said said oligomer comprises, at 
least 3; 4; 5; 6; 7; 8; 9; or at least 10 modified cytokine ligand polypeptides. 

In a further preferred embodiment of the invention said oligomeric cytokine 
polypeptide comprises at least two modified growth hormone polypeptides as 
25 hereindescribed. Preferably said oligomeric growth hormone polypeptide comprises 
multiple ligand polypeptides. 

In an alternative preferred embodiment of the invention there is provided an 
oligomeric cytokine ligand polypeptide comprising at least one modified cytokine 
30 ligand polypeptide according to the invention linked, either directly or indirectly, to 
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at least one native cytokine Hgand polypeptide from which said modified cytokine 
ligand polypeptide was derived. 

In a further alternative embodiment of the invention there is provided a modified 
5 cytokine ligand polypeptide according to the invention linked to at least one 
extracellular ligand binding domain of said ligand' s cognate receptor. 

In our co-pending application, WO01/096565, which is incorporated by reference, 
we disclose fusion proteins which translationally fuse the ligzuid binding domain of a 

10 cytokine to the extracellular receptor binding domain of said ligand via flexible 
peptide linkers. These fusion proteins have delayed clearance and agonist activity. 
The fusion of cytokine to cognate receptor provides an immunologically silent 
polypeptide which has a molecular weight which slows renal clearance. Modified 
cytokine ligand polypeptides as hereindisclosed could also benefit from delayed 

1 5 clearance. 

Peptide linkers that link cytokine ligand polypeptides to one another to form 
oligomeric polypeptides (dimers, trimers etc) and to cognate extracellular receptor 
binding domains are either flexible (e.g. Gly4Ser) or inflexible (e.g. a-helical) or 
20 intermediate (e.g. a combinational linker which is part helical) as described above. 
Linkers may also contain cleavage sites to provide oligomeric polypeptides with 
delayed release characteristics. 

In a preferred embodiment of the invention said linker comprises a cleavage site, 
25 preferably a proteolytic cleavage site. 

Preferably said cleavage site is sensitive to a serum protease or a matrix 
metalloprotease. 

30 In a preferred embodiment of the invention said cleavage site comprises the amino 
acid sequence: LVPRGS, or variant thereof. 
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Preferably, said cleavage site comprises the amino acid sequence PGI(S), or variant 
thereof. 

5 More preferably still said cleavage site comprises the amino acid sequence: LVPRGS 
PGI, or variant thereof. 

Alternatively,' said cleavage site comprises at least two copies of the amino acid 
sequence GGGGS, or functional variant thereof, which flank said cleavage site. 

10. 

In a further preferred embodiment of the invention said cleavage site is sensitive to 
the serum protease thrombin. 

According to a further aspect of the invention there is provided a nucleic acid 
15 molecule which encodes a modified cytokine ligand polypeptide or an oligomeric 
modified cytokine ligand polypeptide according to the invention. 

According to a further aspect of the invention there is provided a vector comprising a 
nucleic acid molecule according to the invention. 

20 

In a preferred embodiment of the invention said vector is an expression vector 
adapted for eukaryotic gene expression. 

Typically said adaptation includes, the provision of transcription control sequences 
(promoter/enhancer sequences) which mediate cell/tissue specific expression. These 
promoter sequences maybe cell/tissue specific, inducible or constitutive. 

Promoter is an art recognised term and, for the sake of clarity, includes the following 
features which are provided by example only, and not by way of limitation. Enhancer 
elements are cis acting nucleic acid sequences often found 5* to the transcription 
initiation site of a gene (enhancers can also be found 3' to a gene sequence or even 
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located in intronic sequences and are therefore position independent). Enhancers 
function to increase the rate of transcription of the gene to which the enhancer is 
linked. Enhancer activity is responsive to trans acting transcription factors 
(polypeptides) which have been shown to bind specifically to enhancer elements. The 
5 binding/activity of transcription factors (please see Eukaryotic Transcription Factors, 
by David S Latchman, Academic Press Ltd, San Diego) is responsive to a number of 
environmental cues that include, by example and not by way of limitation, 
intermediary metabolites (e.g. glucose, lipids), environmental effectors (e.g. heat). 

10 Promoter elements also include so called TATA box and RNA polymerase initiation 
selection (RIS) sequences which function to select a site of transcription initiation. 
These sequences also bind polypeptides that function, inter alia, to facilitate 
transcription initiation selection by RNA polymerase. 

15 Adaptations also include the provision of selectable markers and autonomous 
replication sequences which both facilitate the maintenance of said vector in either 
the eukaryotic cell. Vectors which are maintained autonomously are referred to as 
episomal vectors. 

20 Adaptations which facilitate the expression of vector encoded genes include the 
provision of transcription termination/polyadenylation sequences. This also includes 
the provision of internal ribosome entry sites (IRES) that function to maximise 
expression of vector encoded genes arranged in bicistronic or multi-cistronic 
expression cassettes. 

25 

These adaptations are well known in the art. There is a significant amount of 
published literature with respect to expression vector construction and recombinant 
DNA techniques in general. Please see, Sambrook et al (1989) Molecular Cloning: A 
Laboratory Manual, Cold Spring Harbour Laboratory, Cold Spring Harbour, NY and 
30 references therein; Marston, F (1987) DNA Cloning Techniques: A Practical 
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Approach Vol m IRL Press, Oxford UK; DNA Cloning: F M Ausubel et al, Current 
Protocols in Molecular Biology, John Wiley & Sons, Inc. (1994). 

It will be apparent to one skilled in the art that the vectors according to the invention 
5 could be gene therapy vectors. Gene therapy vectors are typically virus based. A 
number of viruses are commonly used as vectors for the delivery of exogenous genes. 
Commonly employed vectors include recombinantly modified enveloped or non- 
enveloped DNA and RNA viruses, preferably selected from baculoviridiae, 
parvoviridiae, picornoviridiae, herpesveridiae, poxviridae, adenoviridiae, or 
10 picornnaviridiae. Chimeric vectors may also be employed which exploit 
advantageous elements of each of the parent vector properties (See e.g., Feng, et 
al.(1997) Nature Biotechnology 15:866-870). Such viral vectors may be wild-type or 
may be modified by recombinant DNA techniques to be replication deficient, 
conditionally replicating or replication competent. 

15 

Preferred vectors are derived from the adenoviral, adeno-associated viral and 
retroviral genomes. In the most preferred practice of the invention, the vectors are 
derived from the human adenovirus genome. Particularly preferred vectors are 
derived from the human adenovirus serotypes 2 or 5. The replicative capacity of 
20 such vectors may be attenuated (to the point of being considered "replication 
deficient") by modifications or deletions in the El a and/or Elb coding regions. 
Other modifications to the viral genome to achieve particular expression 
characteristics or permit repeat administration or lower immune response are 
preferred. 

25 

Alternatively, the viral vectors may be conditionally replicating or replication 
competent. Conditionally replicating viral vectors are used to achieve selective 
expression in particular cell types while avoiding untoward broad spectrum infection. 
Examples of conditionally replicating vectors are described in Pennisi, E. (1996) 
30 Science 274:342-343; Russell, and S.J. (1994) Eur. J. of Cancer 30A(8): 1165-1171. 
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Additional examples of selectively replicating vectors include those vectors wherein 
a gene essential for replication of the virus is under control of a promoter which is 
active only in a particular cell type or cell state such that in the absence of expression 
of such gene, the virus will not replicate. Examples of such vectors are described in 
5 Henderson, et aL, United States Patent No. 5,698,443 issued December 16, 1997 and 
Henderson, et aL, United States Patent No. 5,871,726 issued February 16, 1999 the 
entire teachings of which are herein incorporated by reference. Vectors may also be 
non- viral and are available from a number of commercial sources readily available to 
a person -skilled in the art. For example, the vectors may be plasmids that can be 
10 episomal or integrating. 

According to a yet further aspect of the invention there is provided a cell transfected 
or transformed with a nucleic acid molecule or vector according to the invention. 

15 In a preferred embodiment of the invention said cell is a eukaryotic cell. Preferably 
said cell is selected from the group consisting of: a mammalian cell (e.g. Chinese 
Hamster Ovary cell); yeast cells (e.g. Saccharomyces spp, Pichia spp); insect cells 
(e.g. Spodoptera spp) or plant cells. 

20 According to a yet further aspect of the invention there is provided a non-human 
transgenic mammal transfected/transformed with the nucleic acid molecule or vector 
according to the invention. 

According to a yet further aspect of the invention there is provide a modified 
25 cytokine ligand polypeptide, an oligomeric modified cytokine ligand polypeptide, a 
nucleic acid molecule, a vector or a cell according to the invention for use as a 
pharmaceutical. 

Preferably said ligand is an antagonist. In an alternative embodiment of the invention 
30 said ligand is an agonist. 
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Preferably said polypeptide, nucleic acid molecule, vector or cell is used in a 
pharmaceutical composition. 

When administered the pharmaceutical composition of the present invention is 
administered in pharmaceutically acceptable preparations. Such preparations may 
5 routinely contain pharmaceutically acceptable concentrations of salt, buffering 
agents, preservatives, compatible carriers, and optionally other therapeutic agents. 

The pharmaceutical composition of the invention can be administered by any 
conventional route, including injection. The administration and application may, for 
10 example, be oral, intravenous, intraperitoneal, intramuscular, intracavity, intra- 
articuar, subcutaneous, topical (eyes), dermal (e.g a cream lipid soluble insert into 
skin or mucus membrane), transdermal, or intranasal. 

Pharmaceutical composition of the invention is administered in effective amounts. 

15 An "effective amount" is that amount of a pharmaceutical/composition that alone, or 
together with further doses or synergistic drugs, produces the desired response. This 
may involve only slowing the progression of the disease temporarily, although more 
preferably, it involves halting the progression of the disease permanently. This can 
be monitored by routine methods or can be monitored according to diagnostic 

20 methods. 

The doses of the pharmaceutical composition administered to a subject can be chosen 
in accordance with different parameters, in particular in accordance with the mode of 
administration used and the state of the subject (i.e. age, sex). Wftien administered, 

25 the pharmaceutical composition of the invention is applied in pharmaceutically- 
acceptable amounts and in pharmaceutically-acceptable compositions. Such 
preparations may routinely contain salts, buffering agents, preservatives, compatible 
carriers, and optionally other therapeutic agents. When used in medicine, the salts 
should be pharmaceutically acceptable, but non-pharmaceutically acceptable salts 

30 may conveniently be used to prepare pharmaceutically-acceptable salts thereof and 
are not excluded from the scope of the invention. Such pharmacologically and 
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pharmaceutically-acceptable salts include, but are not limited to, those prepared from 
the following acids: hydrochloric, hydrobromic, sulfuric, nitric, phosphoric, maleic, 
acetic, salicylic, citric, formic, malonic, succinic, and the like. Also, 
pharmaceutically-acceptable salts can be prepared as alkaline metal or alkaline earth 
5 salts, such as sodium, potassium or calcium salts. 

The pharmaceutical composition may be combined, if desired, with a 
pharmaceutically-acceptable carrier. The term "pharmaceutically-acceptable carrier" 
as used herein means one or more compatible solid or liquid fillers, diluents or 

10 encapsulating substances that are suitable for administration into a human. The term 
"carrier" denotes an organic or inorganic ingredient, natural or synthetic, with which 
the active ingredient is combined to facilitate the application. The components of the 
pharmaceutical compositions also are capable of being co-mingled with the 
molecules of the present invention, and with each other, in a manner such that there 

15 is no interaction that would substantially impair the desired pharmaceutical efficacy. 

The pharmaceutical compositions may contain suitable buffering agents, including: 
acetic acid in a salt; citric acid in a salt; boric acid in a salt; and phosphoric acid in a 
salt. 

20 

The pharmaceutical compositions also may contain, optionally, suitable 
preservatives, such as: benzalkonium chloride; chlorobutanol; parabens and 
thimerosal. 

25 The pharmaceutical compositions may conveniently be presented in unit dosage form 
and may be prepared by any of the methods well-known in the art of pharmacy. All 
methods include the step of bringing the active agent into association with a carrier 
that constitutes one or more accessory ingredients. In general, the compositions are 
prepared by uniformly and intimately bringing the active compound into association 

30 with a liquid carrier, a finely divided solid carrier, or both, and then, if necessary, 
shaping the product. 
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Compositions suitable for oral administration may be presented as discrete units, 
such as capsules, tablets, lozenges, each containing a predetermined amount of the 
active compound. Other compositions include suspensions in aqueous liquids or 
5 non-aqueous liquids such as a syrup, elixir or an emulsion. 

Compositions suitable for parenteral administration conveniently comprise a sterile 
aqueous or non-aqueous preparation that is preferably isotonic with the blood of the 
recipient. This preparation may be formulated according to known methods using 
10 suitable dispersing or wetting agents and suspending agents. The sterile injectable 
preparation also may be a sterile injectable solution or suspension in a non-toxic 
parenterally-acceptable diluent or solvent, for example, as a solution in 1,3-butane 
diol. Among the acceptable solvents that may be employed are water, Ringer's 
solution, and isotonic sodium chloride solution. In addition, sterile, fixed oils are 
15 conventionally employed as a solvent or suspending medium. For this purpose any 
bland fixed oil may be employed including synthetic mono-or di-glycerides. In 
addition, fatty acids such as oleic acid may be used in the preparation of injectables. 
Carrier formulation suitable for oral, subcutaneous, intravenous, intramuscular, etc. 
administrations can be found in Remington's Pharmaceutical Sciences, Mack 
. 20 Publishing Co., Easton, PA. 

Polypeptides/nucleic acid molecules etc according to the invention can be 
incorporated into liposomes. Liposomes are lipid based vesicles which encapsulate a 
selected therapeutic agent which is then introduced into a patient. The liposome is 
manufactured either from pure phospholipid or a mixture of phospholipid and 

25 phosphoglyceride. Typically liposomes can be manufactured with diameters of less 
than 200nm, this enables them to be intravenously injected and able to pass through 
the pulmonary capillary bed. Furthermore the biochemical nature of liposomes 
confers permeability across blood vessel membranes to gain access to selected 
tissues. Liposomes do have a relatively short half-life. So called STEALTH R 

30 liposomes have been developed which comprise liposomes coated in polyethylene 
glycol (PEG). The PEG treated liposomes have a significantly increased half-life 
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when administered intravenously to a patient. In addition, STEALTH R liposomes 
show reduced uptake in the reticuloendothelial system and enhanced accumulation 
selected tissues. In addition, so called immuno-liposomes have been develop which 
combine lipid based vesicles with an antibody or antibodies, to increase the 
5 specificity of the delivery of the agent to a selected cell/tissue. 

The use of liposomes as delivery means is described in US 5580575 and US 
5542935. 



10 According to a further aspect of the invention there is provided a screening method to 
generate modified cytokine ligand polypeptides according to the invention 
comprising the steps of: 

i) forming a preparation comprising native cytokine ligand polypeptide 
molecules wherein the native amino terminal and carboxyl terminal 

15 amino acids are linked either directly or indirectly together, 

ii) generating modified cytokine ligand polypeptide molecules wherein 
said molecules have alternative amino terminal and carboxyl terminal 
amino acids; and 

iii) testing the activity of said modified cytokine ligand polypeptides. 



20 



25 



In a preferred method of the invention said native cytokine is growth hormone. 

Bioassays to test the activity of, for example growth hormone are known in the art 
and are disclosed in WOO 1/096565. 

According to a further aspect of the invention there is provided a modified cytokine 
ligand polypeptide identified by the method according to the invention. 



In a preferred embodiment of the invention said modified cytokine ligand 
30 polypeptide is growth hormone. 
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According to a yet further aspect of the invention there is provided a method of 
treatment of an animal, preferably a human, comprising administering an effective 
amount of a nucleic acid and/or vector and/or polypeptide and /or cell according to 
the invention. 

5 An embodiment of the invention will now be described by example only and with 
reference to the following figures: 

Figure 1 shows the amino acid sequence of human GH with binding site for GHR 
indicated with arrows; 

10 

Figure 2 shows the nucleic acid sequence of human GH; 

Figure 3 shows the amino acid sequence of GHR (extracellular domain underlined); 

15 Figure 4 Cloning strategy for the circular permutation of Growth Hormone. In the 
first PCR reaction a forward primer (GHJDPFor) and a linker primer (GH_CPLink) 
are used to produce a 'megaprimer'. This megaprimer is used with a reverse primer 
(GH_CPRev) to generate the circularly permutated GH gene. Appropriate restriction 
sites [BamHl (B) and Notl (N)] are engineered into the forward and reverse primers 

20 to facilitate ligation into the vector pTrcHis-TOPO; 

Figure 5: Schematic and DNA sequence showing the strategy used to generate 
GH_CP01, (A) a schematic diagram showing how GH is transformed into 
GH_CP01; Glul20 (grey disc) is removed by initiating the new gene at residue 121 

25 and terminating the protein at residue 118, the 'old' termini are linked by joining the 
termini to make a 6 amino acid linker. The helices are numbered in order (from N to 
C terminus) and the arrows denote the direction of the helices (from N to C). (B) the 
DNA sequence of GH and GH_CP01; the nucleotides removed from GH to produce 
GH_CP01 are underlined, the initiation nucleotide for GH_CP01 in GH, and vice 

30 versa, are shown in bold; 
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Figure 6: Western blots showing the expression of GH_CP01 in three different 
systems. The blots show that GH_CP01 is expressed and detected by the antibody 
probes used in the western blot system utilised. TO shows expression at the time of 
induction, T4 shows expression 4 hours post-induction, WP shows whole protein in 
5 the RTS reaction and SP shows j>nly soluble protein in the RTS reaction. Protein 
produced from linear template lacks N-terminal His tag and adapter sequence and 
hence migrates slightly faster; 

Figure 7 is a diagrammatic representation of circularly permuted GH molecules; 

10 

Figure 8 illustrates expression of CP02 protein in E. coli. The induced band of CP02 
protein is denoted by the arrowhead. Lane 1: BioRad protein marker; Lane 2: Non- 
induced cells carrying GHCP02 construct in LB media; Lane 3: Induced cells 
carrying GHCP02 construct; 

15 

Table 1 Primers used to generate GH_CP01. The bold characters denote sequence 
that anneals to the GH gene; the underlined characters denote endonuclease 
restriction sites (BamVLI - ggatcc; NotI - gcggccgc). In the linker primer, 
GH_CP01Link, the sequence that anneals to the carboxyl terminus of GH is shown in 
20 UPPER-CASE; and 

Table 2 Primers used to generate further circularly permuted GH molecules. 

Materials and Methods 

25 

Growth hormone was circularly permutated by using a 2-step PCR methodology. The 
first PCR reaction generated a DNA fragment encoding the new amino terminus to 
the end of the GH gene and an overhang which could anneal to the start of the GH 
gene. This PCR product was then used as a 'megaprimer' in another PCR reaction to 
30 generate the full length GH_CP gene. The relevant primers were designed with 
restriction digestion sites for ligation into an appropriate vector (Figure 4). 
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The GH_CP gene was ligated into an expression vector and this transformed into an 
appropriate strain of E. coli. Expression of GH_CP was confirmed by western blot of 
protein from induced cultures of E. coli containing the GH_CP expression plasmid, 
5 the blots were probed using mouse anti-GH antibodies (10A7, mouse IgGl) and 
Sheep anti-mouse-HRP antibodies (Amersham). 

The GH_CP protein was purified from cell lysates using a metal chelate affinity 
column (Probond resin, Invitrogen) followed by an ion exchange column (MonoQ, 
10 Pharmacia). 

Generation of the GH CP01 gene 

The first embodiment of the growth hormone circular permutation is GH_CP01. The 
15 amino terminus of this construct initiated at residue Del 21 of GH and the carboxy 
terminus was at residue Glull8 of.GH. The 'old' termini of GH were linked by a 6 
amino acid linker, which was formed by joining the 'old' termini -3 amino acids 
from the first helix at the amino terminus and +3 residues from the last helix at the 
carboxy terminus (Figure 5) 

20 

Two PCR reactions were required to generate the circularly permutated GH gene. 
Three primers were designed (Table 1), GH_CP01For consisted of a BarriKl 
restriction endonuclease site followed by the DNA sequence of the new amino 
terminus of the permutated GH; GH_CP01Rev was an antisense primer consisting of 
25 the DNA sequence of the new carboxy terminus of the permutated GH and a Noil 
restriction endonuclease site; GH_CP01Link was an antisense primer designed to 
anneal to both termini of GH. 

The primers GH_CP01For and GHCPOlLink and the template pTrcHisGHstop were 
30 used in the first PCR reaction to produce a *megaprimer\ This megaprimer was 
purified from the PCR reaction and then used in conjunction with GHCPOlRev, 



21 



WO 2005/003165 PCT/GB2004/002827 

again using pTrcHisGHstop as the template, to produce the full length gene for the 
circularly permutated GH (GH_CP01). This gene was then ligated into the pTrcHis- 
TOPO plasmid vector between the BamBI and Noil restriction endonuclease sites. 
The vector was then transformed into E. coli XL1 Blue cells. The success of- the 
5 circular permutation of GH was confirmed by DNA sequencing of the 
pTrcHis.GH_CP01 plasmid. 



10 

Expression of the GH CP 

A variety of expression systems were used to optimise the production of GH_CP01. 
Expression was carried out from the E. coli XL1 Blue:pTrcHis.GH_CP01 cells 

15 induced with IPTG. The GH_CP01 gene was also subcloned into the pHEAT vector 
and the resulting E. coli M72:pHEAT.GH_CP01 cells induced by thermal regulation. 
Cell free in vitro translation was also used for protein expression; both linear 
template and the gene subcloned into the pIVEX-23MCS vector were used to express 
GH_CP01 in the RTS system (Roche). All these systems produced soluble protein 

20 (Figure 6). 

Purification of GH CP 

25 GH_CP01 protein was purified from £. coli XL1 Blue:pTrcHis.GH_CP01 . The cells 
were harvested from the induced growth cultures were resuspended in Resuspension 
Buffer (20mM sodium phosphate buffer, 500mM sodium chloride, 5% glycerol, 
25jig/ml PMSF, pH 7.8) were lysed by a 30 minute incubation on ice after the 
addition of 100|ig/ml (final concentration) hen egg white lysozyme, 250^g/ml (final 

30 concentration) of sodium deoxycholate was then added and the solution incubated 
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for a further 30 minutes on ice, the lysed cells were then sonicated. Insoluble 
material was removed by centrifugation at 19000rpm for 30 minutes. 

The cleared cell lysate was applied to a 5ml Probond resin column (Invitrogen) 
5 which had been charged with Co 2+ and equilibrated with Equilibration Buffer 
(20mM sodium phosphate buffer, 500mM sodium chloride, 5% glycerol, pH 7.8), 
after loading the protein sample the column was washed with a further 10-20ml of 
Equilibration Buffer. The column was then washed with Wash Buffer (20mM 
sodium phosphate buffer, 500mM sodium chloride, 5% glycerol, pH 6.0) until the 
10 OD 2 go of the eluate <0.01 . Bound protein was eluted using 5ml Elution Buffer (Wash 
Buffer containing 500mM imidazole, pH 6.0). 

The protein was dialysed overnight against Low Salt Buffer (25mM TRIS, ImM 
EDTA, 5% glycerol, pH 8.0) and then centrifuged to remove any particulate matter. 

15 The protein sample was then loaded onto a Mono-Q column (Pharmacia), which had 
been pre-equilibrated with Low Salt Buffer. After a 10 column volume wash with 
Low Salt Buffer, the bound proteins were eluted over 20 column volumes using a 
gradient between 0M sodium chloride to 1M sodium chloride (in 25mM TRIS, ImM 
EDTA, 5% glycerol, pH8.0). Peaks on the elution profile were analysed by SDS- 

20 PAGE and western blotting. 

GH_CP protein was then concentrated (if required) using an Amicon Centriprep Y- 
10 column. 

25 The purity of the purified GH_CP01 was confirmed by SDS-PAGE, by both 
coomassie staining and western blot (results not shown). Once the integrity of this 
sample had been confirmed, GH_CP01 was submitted to the previously established 
bioassay (Ross et aL> 1997). 

30 Example 1 

Alternative approach to constructing circular permutations by way of example: 
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The circular permuted hGH described earlier 

IQTLMGRLED GSPRTGQIFK QTYSKFDTNS HNDDALLKNY GLLYCFRKDM 
DKVETFLRIV QCRSVEGSTI PLSRLFDNAS LRAHRLHQLA FDTYQEFEEA 
5 YIPKEQKYSF LQNPQTSLCF SESIPTPSNR EETQQKSNLE LLRISLLLIQ 
S WLEPVQFLR SVFANSLVYG ASDSNVYDLL KDLE 

can be encoded by a synthetic gene taking account of optimal codon usage for the 
expression system. Synthetic genes are readily produced by total gene synthesis. For 
10 example the following sequence (producing CP_01) could be constructed for 
expression in E. coli under control of a suitable promoter (ribosomal binding site 
underlined: 



15 



1 GAGGACTTAAATTAAATA 





17 


Met He 


Gin 


Thr 


Leu 


Met 


Gly 


Arg 


Leu 


Glu 


Asp 


Gly 


Ser 


Pro 


Arg 


Thr 


Gly 


20 


19 


ATG ATC 


CAG 


ACC 


TTA 


ATG 


GGC 


CGC 


CTG 


GAA 


GAC 


GGT 


AGC 


CCG 


CGG 


ACG 


GGA 


34 


Gin He 


Phe 


Lys 


Gin 


Thr 


Tyr 


Ser 


Lys 


Phe 


Asp 


Thr 


Asn 


Ser 


His 


Asn 


Asp 




70 


CAA ATT 


TTC 


AAA 


CAG 


ACC 


TAT 


AGT 


AAA 


TTT 


GAT 


ACG 


AAC 


AGC 


CAT 


AAC 


GAC 




51 


Asp Ala 


Leu 


Leu 


Lys 


Asn 


Tyr 


Gly 


Leu 


Leu 


Tyr 


Cys 


Phe 


Arg 


Lys 


Asp 


Met 


25 


121 


GAT GCT 


CTA 


CTG 


AAA 


AAC 


TAT 


GGT 


CTG 


CTC 


TAC 


TGC 


TTC 


CGC 


AAG 


GAT 


ATG 




68 


Asp Lys 


Val 


Glu 


Thr 


Phe 


Leu 


Arg 


He 


Val 


Gin 


Cys 


Arg 


Ser 


Val 


Glu 


Gly 




172 


GAT AAA 


GTT 


GAA 


ACC 


TTT 


CTG 


CGC 


ATA 


GTG 


CAG 


TGT 


CGA 


TCT 


GTG 


GAG 


GGC 


30 




Ser Thr 


He 


Pro 


Leu 


Ser 


Arg 


Leu 


Phe 


Asp 


Asn 


Ala 


Ser 


Leu 


Arg 


Ala 


His 


85 






























223 


TCC ACT 


ATT 


CCT 


CTG 


TCC 


CGC 


TTG 


TTC 


GAC 


AAT 


GCC 


TCA 


TTA 


CGT 


GCA 


CAC 






Arg Leu 


His 


Gin 


Leu 


Ala 


Phe 


Asp 


Thr 


Tyr 


Gin 


Glu 


Phe 


Glu 


Glu 


Ala 


Tyr 


35 


102 
274 


AGA TTG 


CAT 


CAG 


CTT 


GCC 


TTT 


GAT 


ACG 


TAC 


CAG 


GAG 


TTT 


GAA 


GAA 


GCG 


TAT 


119 


He Pro 


Lys 


Glu 


Gin 


Lys 


Tyr 


Ser 


Phe 


Leu 


Gin 


Asn 


Pro 


Gin 


Thr 


Ser 


Leu 




325 


ATT CCG 


AAG 


GAG 


CAA 


AAA 


TAG 


TCT 


TTT 


CTG 


CAA 


AAT 


CCG 


CAG 


ACC 


TCG 


CTG 






Cys Phe 


Ser 


Glu 


Ser 


He 


Pro 


Thr 


Pro 


Ser 


Asn 


Arg 


Glu 


Glu 


Thr 


Gin 


Gin 


40 


136 
376 


TGC TTC 


AGT 


GAA 


AGC 


ATT 


CCG 


ACT 


CCA 


TCG 


AAC 


CGT 


GAG 


GAA 


AGA 


CAG 


CAG 






Lys Ser 


Asn 


Leu 


Glu 


Leu 


Leu 


Arg 


He 


Ser 


Leu 


Leu 


Leu 


He 


Gin 


Ser 


Trp 




153 
427 


AAA TCC 


AAT 


CTG 


GAA 


CTG 


CTT 


CGT 


ATC 


AGC 


TTA 


CTG 


CTC 


ATC 


CAA 


AGC 


TGG 


45 




Leu Glu 


Pro 


Val 


Gin 


Phe 


Leu 


Arg 


Ser 


Val 


Phe 


Ala 


Asn 


Ser 


Leu 


val 


Tyr 


170 
































478 


TTG GAA 


ccc 


GTC 


CAA 


TTC 


CTT 


CGT 


TCA 


GTG 


TTT 


GCG 


AAT 


AGT 


CTG 


GTT 


TAT 




185 


Gly Ala 


Ser 


Asp 


Ser 


Asn 


Val 


Tyr 


Asp 


Leu 


Leu 


Lys 


Asp 


Leu 


Glu 


Stop 




529 


GGG GCA 


AGT 


GAC 


TCT 


AAC 


GTC 


TAT 


GAT 


CTG 


CTG 


AAA 


GAT 


CTC 


GAATAA 





50 

This could be adapted to carry a suitable purification tag such as a polyhistidine tract 
incorporated at the C or N-terminus are required to aid in purification. 
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Example 2 

An alternative permutation CP_02 such as: 

5 MEIQTLMGRL EDGSPRTGQI FKQTYSKFDT NSHNDDALLK NYGLLYCFRK 
DMDKVETFLR IVQCRS VEGS TIPLSRLFDN ASLRAHRLHQ LAFDTYQEFE 
EAYIPKEQKY SFLQNPQTSL CFSESDPTPS NREETQQKSN LELLRISLLL 
IQS WLEPVQF LRS VFANSLV YGASDSNVYD LLKDL 
in which the break is made one residue earlier than in CP01 

10 

CP_02 is encoded by : 

1 GAGGACTTAAATTAAATA 



1 c 

15 




Met Glu 


He 


Gin 


Thr 


Leu 


Met 


Gly 


Arg 


Leu 


Glu 


Asp 


Gly 


Ser 


Pro 


Arg 


Thr 


17 




























19 


ATG GAA 


ATC 


CAG 


ACC 


TTA 


ATG 


GGC 


CGC 


CTG 


GAA 


GAC 


GGT 


AGC 


CCG 


CGG 


ACG 




34 


Gly Gin 


He 


Phe 


Lys 


Gin 


Thr 


Tyr 


Ser 


Lys 


Phe 


Asp 


Thr 


Asn 


Ser 


His 


Asn 


20 


70 


GGA CAA 


ATT 


TTC 


AAA 


CAG 


ACC 


TAT 


AGT 


AAA 


TTT 


GAT 


ACG 


AAC 


AGC 


CAT 


AAC 


51 


Asp Asp 


Ala 


Leu 


Leu 


Lys 


Asn 


Tyr 


Gly 


Leu 


Leu 


Tyr 


Cys 


Phe 


Arg 


Lys 


Asp 




121 


GAC GAT 


GOT 


CTA 


CTG 


AAA 


AAC 


TAT 


GGT 


CTG 


CTC 


TAC 


TGC 


TTC 


CGC 


AAG 


GAT 




68 


Met Asp 


Lys 


val 


Glu 


Thr 


Phe 


Leu 


Arg 


He 


Val 


Gin 


Cys 


Arg 


Ser 


Val 


Glu 


25 


172 


ATG GAT 


AAA 


GTT 


GAA 


ACC 


TTT 


CTG 


CGC 


ATA 


GTG 


CAG 


TGT 


CGA 


TCT 


GTG 


GAG 




85 


Gly Ser 


Thr 


He 


Pro 


Leu 


Ser 


Arg 


Leu 


Phe 


Asp 


Asn 


Ala 


Ser 


Leu 


Arg 


Ala 




223 


GGC TCC 


ACT 


ATT 


CCT 


CTG 


TCC 


CGC 


TTG 


TTC 


GAC 


AAT 


GCC 


TCA 


TTA 


CGT 


GCA 


30 




His Arg 


Leu 


His 


Gin 


Leu 


Ala 


Phe 


Asp 


Thr 


Tyr 


Gin 


Glu 


Phe 


Glu 


Glu 


Ala 


102 
































274 


CAC AGA 


TTG 


CAT 


CAG 


CTT 


GCC 


TTT 


GAT 


ACG 


TAC 


CAG 


GAG 


TTT 


GAA 


GAA 


GCG 




119 


Tyr lie 


Pro 


Lys 


Glu 


Gin 


Lys 


Tyr 


Ser 


Phe 


Leu 


Gin 


Asn 


Pro 


Gin 


Thr 


Ser 


35 


325 


TAT ATT 


CCG 


AAG 


GAG 


CAA 


AAA 


TAC 


TCT 


TTT 


CTG 


CAA 


AAT 


CCG 


CAG 


ACC 


TCG 


136 


Leu Cys 


Phe 


Ser 


Glu 


Ser 


He 


Pro 


Thr 


Pro 


Ser 


Asn 


Arg 


Glu 


Glu 


Thr 


Gin 




376 


CTG TGC 


TTC 


AGT 


GAA 


AGC 


ATT 


CCG 


ACT 


CCA 


TCG 


AAC 


CGT 


GAG 


GAA 


ACA 


CAG 




153 


Gin Lys 


Ser 


Asn 


Leu 


Glu 


Leu 


Leu 


Arg 


He 


Ser 


Leu 


Leu 


Leu 


lie 


Gin 


Ser 


40 


427 


CAG AAA 


TCC 


AAT 


CTG 


GAA 


CTG 


CTT 


CGT 


ATC 


AGC 


TTA 


CTG 


CTC 


ATC 


CAA 


AGC 




170 


Trp Leu 


Glu 


Pro 


Val 


Gin 


Phe 


Leu 


Arg 


Ser 


Val 


Phe 


Ala 


Asn 


Ser 


Leu 


Val 




478 


TGG TTG 


GAA 


CCC 


GTC 


CAA 


TTC 


CTT 


CGT 


TCA 


GTG 


TTT 


GCG 


AAT 


AGT 


CTG 


GTT 


45 




Tyr Gly 


Ala 


Ser 


Asp 


Ser 


Asn 


Val 


Tyr 


Asp 


Leu 


Leu 


Lys 


Asp 


Leu 


Stop 


185 






























529 


TAT GGG 


GCA 


AGT 


GAC 


TCT 


AAC 


GTC 


TAT 


GAT 


CTG 


CTG 


AAA 


GAT 


CTC 


TAA 





SO Example 3 

A further alternative, CP_03 

MQTLMGRLED GSPRTGQIFK QTYSKFDTNS HNDDALLKNY 
55 GLLYCFRKDM DKVETFLRTV QCRSVEGSTI PLSRLFDNAS LRAHRLHQLA 
FDTYQEFEEA YIPKEQKYSF LQNPQTSLCF SESIPTPSNR EETQQKSNLE 
LLRISLLLIQ SWLEPVQFLR SVFANSLVYG ASDSNVYDLL KDLE 
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in which the initial I in CP01 is removed and replaced by the M 
Is encoded by : 

1 GAGGACTTAAATTAAATA 





Met 


Gin 


Thr 


Leu 


Met Gly 


Arg 


Leu 


Glu 


Asp 


Gly 


Ser 


Pro 


Arg 


Thr 


Gly 


Gin 


17 

19 


ATG 


CAG 


ACC 


TTA 


ATG 


GGC 


CGC 


CTG 


GAA 


GAC 


GGT 


AGC 


CCG 


CGG 


ACG 


GGA 


CAA 




lie 


Phe. 


Lys Gin 


Thr Tyr 


Ser 


Lys 


Phe 


Asp 


Thr 


Asn 


Ser 


His 


Asn 


Asp* 


Asp 


34 

70 


ATT 


TTC 


AAA 


CAG 


ACC 


TAT 


.AGT 


AAA 


TTT 


GAT 


ACG 


AAC 


AGC 


CAT 


AAC 


GAC 


GAT 




Ala 


Leu 


Leu Lys 


Asn Tyr 


Gly 


Leu 


Leu 


Tyr 


Cys 


Phe 


Arg 


Lys 


Asp 


Met 


Asp 


51 

121 


GCT 


CTA 


CTG 


AAA 


AAC 


TAT 


GGT 


CTG 


CTC 


TAC 


TGC 


TTC 


CGC 


AAG 


GAT 


ATG 


GAT 


68 


Lys 


val 


Glu 


Thr 


Phe 


Leu 


Arg 


He 


Val 


Gin 


Cys 


Arg 


Ser 


Val 


Glu 


Gly 


Ser 


172 


AAA 


GTT 


GAA 


ACC 


TTT 


CTG 


CGC 


ATA 


GTG 


CAG 


TGT 


CGA 


TCT 


GTG 


GAG 


GGC 


TCC 




Thr 


lie 


Pro 


Leu 


Ser Arg 


Leu 


Phe 


Asp 


Asn 


Ala 


Ser 


Leu 


Arg 


Ala 


His 


Arg 


85 

223 


ACT 


ATT 


CCT 


CTG 


TCC 


CGC 


TTG 


TTC 


GAC 


AAT 


GCC 


TCA 


TTA 


CGT 


GCA 


CAC 


AGA 




Leu 


His 


Gin 


Leu 


Ala 


Phe 


Asp 


Thr 


Tyr 


Gin 


Glu 


Phe 


Glu 


Glu 


Ala 


Tyr 


He 


102 
274 


TTG 


CAT 


CAG 


CTT 


GCC TTT 


GAT 


ACG 


TAC 


CAG 


GAG 


TTT 


GAA 


GAA 


GCG 


TAT 


ATT 




Pro 


Lys 


Glu Gin 


Lys 


Tyr 


Ser 


Phe 


Leu 


Gin 


Asn 


Pro 


Gin 


Thr 


Ser 


Leu 


Cys 


119 
325 


CCG 


AAG 


GAG 


CAA 


AAA 


TAC 


TCT 


TTT 


CTG 


CAA 


AAT 


CCG 


CAG 


ACC 


TCG 


CTG 


TGC 




Phe 


Ser 


Glu 


Ser 


He 


Pro 


Thr 


Pro 


Ser 


Asn 


Arg 


Glu 


Glu 


Thr 


Gin 


Gin 


Lys 


136 
376 


TTC 


AGT 


GAA AGC 


ATT 


CCG 


ACT 


CCA 


TCG 


AAC 


CGT 


GAG 


GAA 


ACA 


CAG 


CAG 


AAA 




Ser 


Asn 


Leu 


Glu 


Leu 


Leu 


Arg 


He 


Ser 


Leu 


Leu 


Leu 


He 


Gin 


Ser 


Trp 


Leu 


153 
427 


TCC 


AAT 


CTG 


GAA 


CTG 


CTT 


CGT 


ATC 


AGC 


TTA 


CTG 


CTC 


ATC 


CAA 


AGC 


TGG 


TTG 




Glu 


Pro 


Val 


Gin 


Phe 


Leu 


Arg 


Ser 


Val 


Phe 


Ala 


Asn 


Ser 


Leu 


val 


Tyr 


Gly 


170 
478 


GAA 


CCC 


GTC 


CAA 


TTC 


CTT 


CGT 


TCA 


GTG 


TTT 


GCG 


AAT 


AGT 


CTG 


GTT 


TAT 


GGG 




Ala 


Ser 


Asp 


Ser 


Asn 


Val 


Tyr 


Asp 


Leu 


Leu 


Lys 


Asp 


Leu 


Glu 


Stop 




184 
529 


GCA 


AGT 


GAC 


TCT 


AAC 


GTC 


TAT 


GAT 


CTG 


CTG 


AAA 


GAT 


CTC 


GAA 


TAA 







Example 4 



A further variation: Position cysteine residues at C and N termini- so that they may 
form a disulphide bond, thus making a covalently closed, circular molecule: 



CP_04 

MCIQTLMGRL EDGSPRTGQI FKQTYSKFDT NSHNDDALLK NYGLLYCFRK 
DMDKVETFLR IVQCRSVEGS T1PLSRLFDN ASLRAHRLHQ LAFDTYQEFE 
EAYIPKEQKY SFLQNPQTSL CFSESIPTPS NREETQQKSN LELLRISLLL 
IQSWLEPVQF LRSVFANSLV YGASDSNV YD LLKDLEC 

New cystein residues underlined. 

Encoded by: 

1 GAGGACTTAAATTAAATA 
Met Cys He Gin Thr Leu Met Gly Arg Leu Glu Asp Gly Ser Pro Arg Thr 

17 

19 ATG TGT ATC CAG ACC TTA ATG GGC CGC CTG GAA GAC GGT AGC CCG CGG ACG 
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Glv 


Gin 


He 


Phe 


Lys 


J* 












/ U 


on a 
oun 


CAA ATT 


TTf 


AAA 




Asp 


Asp Ala 


Leu 


Leu 


OX 












-L Z 1 


UAL 






Win 


CTG 






Asp 


LyS 


val 


Glu 


D O 












J. 




ui*l 


& & a 
AAA 


GTT 


GAA 






Ser 


Tnr 


He 


Pro 


ft Q 
















TCC 


ACT 


ATT 


CCT 




(lis 


Arg 


Leu 


His 


Gin 


JLU* 












274 


L-AC 


AGA 


TTG 


Lni 


PAR 






lie 


Pro 


Lys 


Glu 


1 1 o 

117 














Tan* 
1A1 


ATT 


CCG 


AAG 


GAG 




Leu 


Cys 


Phe 


Ser 


Glu 














-* /O 


CTG 


TGC 


TTC 


AGT 


GAA 




Gin 


Lys 


Ser 


Asn 


Leu 


153 












427 


CAG 


AAA 


TCC 


AAT 


CTG 




Trp 


Leu 


Glu 


Pro 


val 


170 












478 


TGG 


TTG 


GAA 


CCC 


GTC 




Tyr 


Gly Ala 


Ser Asp 


187 












529 


TAT 


GGG 


GCA 


AGT 


GAC 




Stop 








187 












580 


TAA 











Gin Thr Tyr 


Ser Lys 


Phe 


CAG 
Lys 


ACC TAT 
Asn Tyr 


AGT AAA 
Gly Leu 


TTT 
Leu 


AAA 

Thr 


AAC 
Phe 


TAT 
Leu 


Arg 


CTG 
He 


CTC 
val 


ACC 
Leu 


TTT CTG 
Ser Arg 


car 
Leu 


ATA 
Phe 


GTG 
Asp 


CTG 
Leu 


TCC 
Ala 


CGC 
Phe 


TTG 
Asp 


TTC 
Thr 


GAC 
Tyr 


CTT GCC TTT 
Gin Lys Tyr 


GAT 
Ser 


ACG 
Phe 


TAC 
Leu 


CAA 
Ser 


AAA 
He 


TAC 
Pro 


TCT 
Thr 


TTT 
Pro 


CTG 
Ser 


AGC 
Glu 


ATT 
Leu 


CCG 
Leu 


ACT 
Arg 


CCA 
He 


TCG 
ser 


GAA 
Gin 


CTG 
Phe 


CTT 
Leu 


CGT 
Arg 


ATC 
Ser 


AGC 
Val 


CAA 
Ser 


TTC 
Asn 


CTT 
Val 


CGT TCA 
Tyr Asp 


GTG 
Leu 


TCT 


AAC 


GTC 


TAT 


GAT 


CTG 



Asp 


Thr 


Asn 


Ser 


His 


Asn 


GAT 
Tyr 


ACG 
Cys 


AAC 
Phe 


AGC 
Arg 


CAT 
Lys 


AAC 
Asp 


TAC 
Gin 


TGC 
Cys 


TTC 
Arg 


CGC 
Ser 


AAG 
Val 


GAT 
GlU 


CAG 
Asn 


TGT 
Ala 


CGA 
Ser 


TCT 
Leu 


GTG 
Arg 


GAG 
Ala 


AAT 
Gin 


GCC 
Glu 


TCA 
Phe 


TTA 
Glu 


CGT 
Glu 


GCA 
Ala 


CAG 

Gin 


GAG 
Asn 


TTT 
Pro 


GAA 
Gin 


GAA 
Thr 


GCG 
Ser 


CAA 
Asn 


AAT 
Arg 


CCG 
Glu 


CAG 
Glu 


ACC 
Thr 


TCG 
Gin 


AAC 
Leu 


CGT 
Leu 


GAG 
Leu 


GAA 
He 


ACA 
Gin 


CAG. 
Ser 


TTA 
Phe 


CTG 
Ala 


CTC 
Asn 


ATC 
Ser 


CAA 
Leu 


AGC 
Val 


TTT 
Leu 


GCG 
Lys 


AAT 
Asp 


AGT 
Leu 


CTG 
Glu 


GTT 
Cys 


CTG 


AAA 


GAT 


CTC 


GAA 


TGT 
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